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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: Lai, Preeti 

Corley, Neil C . 



(ii) TITLE OF THE INVENTION: HUMAN SHORT CHAIN DEHYDROGENASE 

(iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Dr. 

(C) CITY: Palo AltO 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

( C ) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-047 5 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-855-0555 

(B) TELEFAX: 650-845-4166 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSNOT01 
<B) CLONE: 356351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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Met Ala Ala Pro Met Asn Gly Gin Val Cys Val Val Thr Gly Ala Ser 

15 10 15 

Arg Gly lie Gly Arg Gly lie Ala Leu Gin Leu Cys Lys Ala Gly Ala 

20 25 30 

Thr Val Tyr lie Thr Gly Arg His Leu Asp Thr Leu Arg Val Val Ala 

35 40 45 

Gin Glu Ala Gin Ser Leu Gly Gly Gin Cys Val Pro Val Val Cys Asp 

50 55 60 

Ser Ser Gin Glu Ser Glu Val Arg Thr Leu Phe Glu Gin Val Asp Arg 
65 70 75 80 

Glu Gin Gin Gly Arg Leu Asp Val Leu Val Asn Asn Ala Tyr Ala Gly 

85 90 95 

Val Gin Thr lie Leu Asn Thr Arg Asn Lys Ala Phe Trp Glu Thr Pro 

100 105 110 

Ala Ser Met Trp Asp Asp lie Asn Asn Val Gly Leu Arg Gly His Tyr 

115 120 125 

Phe Cys Ser Val Tyr Gly Ala Arg Leu Met Val Pro Ala Gly Gin Gly 

130 135 140 

Leu lie Val Val lie Ser Ser Pro Gly Ser Leu Gin Tyr Met Phe Asn 
145 150 155 160 

Val Pro Tyr Gly Val Gly Lys Ala Ala Cys Asp Lys Leu Ala Ala Asp 

165 170 175 

Cys Ala His Glu Leu Arg Arg His Gly Val Ser Cys Val Ser Leu Trp 

180 185 190 

Pro Gly lie Val Gin Thr Glu Leu Leu Lys Glu His Met Ala Lys Glu 

195 200 205 

Glu Val Leu Gin Asp Pro Val Leu Lys Gin Phe Lys Ser Ala Phe Ser 

210 215 220 

Ser Ala Glu Thr Thr Glu Leu Ser Gly Lys Cys Val Val Ala Leu Ala 
* 225 230 235 240 

]-•== Thr Asp Pro Asn lie Leu Ser Leu Ser Gly Lys Val Leu Pro Ser Cys 

fll 245 250 255 

Asp Leu Ala Arg Arg Tyr Gly Leu Arg Asp Val Asp Gly Arg Pro Val 

260 265 270 

Gin Asp Tyr Leu Ser Leu Ser Ser Val Leu Ser His Val Ser Gly Leu 
£ fe 275 280 285 

Gly Trp Leu Ala Ser Tyr Leu Pro Ser Phe Leu Arg Val Pro Lys Trp 

290 295 300 

lie lie Ala Leu Tyr Thr Ser Lys Phe 
305 310 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1387 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSNOT01 

(B) CLONE: 356351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

CTAACTTTGG CCTGGGACTC TGCCCCTCTA CCTCAGCACA GAATCGCCCC GGGTCCTACT 60 

ACAGAATCAA TCCTTGAACA CTGCCTCCAC GTCGCCGGCT CAATCTGGGC GAGAACCCAG 12 0 

ACTTCCACCG CAGCCCCGCA ATCTGCAGAC CTCAGCGGCA GCGCAGGTGG CAGACCTGCC 180 

TCCTTTGCCT GTGAGTCATG GCAGCTCCCA TGAATGGCCA AGTGTGTGTG GTGACTGGTG 240 



m. 
Si 

m 
m 

Up 
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CCTCCAGGGG TATTGGCCGT GGCATTGCCT TGCAGCTCTG CAAAGCAGGC GCCACAGTTT 3 00 

ACATCACTGG CCGCCATCTG GACACCCTTC GCGTTGTTGC TCAGGAGGCA CAATCCCTCG 3 60 

GGGGCCAATG TGTGCCTGTG GTGTGCGATT CAAGCCAGGA GAGTGAAGTG CGAACGCTGT 42 0 

TTGAGCAAGT GGATCGGGAA CAGCAAGGGC GTCTAGATGT GCTGGTCAAC AATGCTTATG 480 

CAGGGGTCCA GACGATCCTG AACACCAGGA ATAAGGCATT CTGGGAAACC CCTGCCTCCA 54 0 

TGTGGGATGA TATCAACAAC GTCGGACTCA GAGGCCACTA CTTTTGCTCA GTGTATGGGG 600 

CACGGCTGAT GGTACCAGCT GGCCAGGGGC TCATCGTGGT CATCTCCTCC CCAGGAAGCC 660 

TGCAGTATAT GTTCAATGTC CCCTATGGTG TGGGCAAAGC TGCGTGTGAC AAGCTGGCTG 72 0 

CTGACTGTGC CCACGAGCTG CGGCGCCATG GGGTCAGCTG TGTGTCTCTG TGGC CGGGG A 780 

TTGTGCAGAC AGAACTGCTG AAGGAGCATA TGGCAAAGGA GGAGGTCCTG CAGGATCCTG 840 

TGTTGAAGCA GTTCAAATCA GCCTTCTCAT CTGCAGAAAC CACAGAATTG AGTGGCAAAT 900 

GTGTGGTGGC TTTGGCAACA GATCCCAATA TCCTGAGCCT GAGTGGTAAG GTGCTGCCAT 9 60 

CCTGTGACCT TGCTCGACGC TATGGCCTTC GGGATGTGGA CGGCCGCCCC GTCCAAGACT 102 0 

ATTTGTCTTT GAGCTCTGTT CTCTCACACG TGTCCGGCCT GGGCTGGCTG GCCTCCTACC 1080 

TGCCCTCCTT CCTCCGTGTG CCCAAGTGGA TTATTGCCCT CTACACTAGC AAGTTC TAAC 1140 

CCTCCTGGTC TGACACTACG TCTCTGCTTG TCTTCTCATT TGGACTTGGT GGTTCGTCCT 12 00 

GTCTCAGTGA AACAGCAGCC TTTCTTGTTT ACCCATACCC TTGATATGAA GAGAAGCCCT 12 60 

CTGCTGTGTG TCCGTGGTGA GTTCTGGGGT GCGCCTAGGT CCCTTCTTTG TGCCTTGGTT 132 0 

TTCCTTGTCC TTC TTTTTAC TTTTTGCCTT AGTATTGAAA AATGCTCTTG GAGCTAATAA 13 80 

AAGTCTA 13 87 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 323 amino acids 



f (B) TYPE: amino acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



cri 

•pnr 

P (vii) IMMEDIATE SOURCE: 
M (A) LIBRARY: GenBank 

fU (B) CLONE: 2315796 

P 

J* (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

6 

t*b Met Gly Val lie Leu Gin Asp Gin Val Ala Leu Val Thr Gly Ala Ser 

15 10 15 

Arg Gly lie Gly Arg Gly lie Ala Leu Gin Leu Gly Glu Ala Gly Ala 

20 25 30 

Thr Val Tyr lie Thr Gly Arg Arg Pro Glu Leu Ser Asp Asn Phe Arg 

35 40 45 

Leu Gly Leu Pro Ser Leu Asp Tyr Val Ala Lys Glu lie Thr Ser Arg 

50 55 60 

Gly Gly Lys Gly lie Ala Leu Tyr Val Asp His Ser Asn Met Thr Glu 
65 70 75 80 

Val Lys Phe Leu Phe Glu Lys lie Lys Glu Asp Glu Glu Gly Lys Leu 

85 90 95 

Asp lie Leu Val Asn Asn Val Tyr Asn Ser Leu Gly Lys Ala Thr Glu 

100 105 110 

Met lie Gly Lys Thr Phe Phe Asp Gin Asp Pro Ser Phe Trp Asp Asp 

115 120 125 

lie Asn Gly Val Gly Leu Arg Asn His Tyr Tyr Cys Ser Val Tyr Ala 

130 135 140 

Ala Arg Met Met Val Glu Arg Arg Lys Gly Leu He Val Asn Val Gly 
145 150 155 160 

Ser Leu Gly Gly Leu Lys Tyr Val Phe Asn Val Ala Tyr Gly Ala Gly 

165 170 175 

Lys Glu Ala Leu Ala Arg Met Ser Thr Asp Met Ala Val Glu Leu Asn 
180 185 190 
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Pro Tyr Asn Val Cys Val Val Thr Leu lie Pro Gly Pro Val Lys Thr 

195 200 205 

Glu Thr Ala Asn Arg Thr lie lie Asp Asp Ala Tyr Lys Met lie Lys 

210 215 220. 

Glu Asn Pro Glu Leu Glu Glu Phe lie Lys Gly Glu Ser Thr Glu Tyr 
225 230 235 240 

Thr Gly Lys Ala Leu Ala Arg Leu Ala Met Asp Pro Gly Lys Leu Lys 

245 250 255 

Lys Ser Gly Lys Thr Leu Phe Thr Glu Asp Leu Ala Gin Lys Tyr Asp 

260 265 270 

Phe Ser Asp Lys His Gly Ala Gly Met Glu Pro Gin Asn lie Arg Ser 

275 280 285 

lie Arg Thr lie Leu Gly Thr Met Gly Lys Glu Glu Val Ala Lys Tyr 

290 295 300 

lie Pro Pro Gin lie Lys Leu Pro Lys Trp Val lie Trp Gin Ser Val 
305 310 315 320 

Asn Arg Phe 



j«k 
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